Lesson: Using Text Data Processing 
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Figure 40: Text Data Processing Architecture 





1. The Extraction, Transformation, and Loading (ETL) designer sets up TDP jobs using the 
DS Designer. 


2. Data Services accesses text content from sources such as surveys, notes fields in 
databases, or text files directly. Connectors to email or internet sources can also be built. 
As long as the content is in HTML, XML, or TXT we can process it. 


3. Optionally, the results of TDP can be passed to DQ for cleansing and normalization before 
being stored to a repository. From here, the results can be consumed either directly by an 
application or dashboard or via the BI semantic layer. 


Entity Types 
Supported Entity Types for Extraction 
e Who: people, job title, and social security numbers 
e What: companies, organizations, financial indexes, and products 
e When: dates, days, holidays, months, years, times, and time periods 


«e Where: addresses, cities, states, countries, facilities, internet addresses, and phone 
numbers 


e How Much: currencies and units of measure 
e Generic Concepts text data, global piracy, and so on 


e Language Support English, French, German, Simplified Chinese, Spanish, Japanese 
(concepts only), and over 25 other languages 


Predefined Extraction of Sentiments, Events, and Relationships 


Voice of Customer Rules 
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